Regret Minimization in Games with Incomplete Information

ثبت نشده

چکیده

Extensive games are a powerful model of multiagent decision-making scenarioswith incomplete information. Finding a Nash equilibrium for very large instancesof these games has received a great deal of recent attention. In this paper, wedescribe a new technique for solving large games based on regret minimization.In particular, we introduce the notion of counterfactual regret, which exploits thedegree of incomplete information in an extensive game. We show howminimizingcounterfactual regret minimizes overall regret, and therefore in self-play can beused to compute a Nash equilibrium. We demonstrate this technique in the domainof poker, showing we can solve abstractions of limit Texas Hold’em with as manyas 10 states, two orders of magnitude larger than previous methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Regret Minimization in Games with Incomplete Information

متن کامل

No-Regret Learning in Repeated Bayesian Games

Recent price-of-anarchy analyses of games of complete information suggest that coarse correlated equilibria, which characterize outcomes resulting from no-regret learning dynamics, have near-optimal welfare. This work provides two main technical results that lift this conclusion to games of incomplete information, a.k.a., Bayesian games. First, near-optimal welfare in Bayesian games follows dir...

متن کامل

No-Regret Learning in Bayesian Games

متن کامل

Adaptive Regret Minimization in Bounded-Memory Games

Online learning algorithms that minimize regret provide strong guarantees in situations that involve repeatedly making decisions in an uncertain environment, e.g. a driver deciding what route to drive to work every day. While regret minimization has been extensively studied in repeated games, we study regret minimization for a richer class of games called bounded memory games. In each round of ...

متن کامل

Monte Carlo Sampling for Regret Minimization in Extensive Games

Sequential decision-making with multiple agents and imperfect information is commonly modeled as an extensive game. One efficient method for computing Nash equilibria in large, zero-sum, imperfect information games is counterfactual regret minimization (CFR). In the domain of poker, CFR has proven effective, particularly when using a domain-specific augmentation involving chance outcome samplin...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2007

Regret Minimization in Games with Incomplete Information

ثبت نشده

چکیده

منابع مشابه

Regret Minimization in Games with Incomplete Information

No-Regret Learning in Repeated Bayesian Games

No-Regret Learning in Bayesian Games

Adaptive Regret Minimization in Bounded-Memory Games

Monte Carlo Sampling for Regret Minimization in Extensive Games

عنوان ژورنال:

اشتراک گذاری